AITopics | satisfaction rating

Collaborating Authors

satisfaction rating

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation

Liang, Kaiqu, Hu, Haimin, Liu, Ryan, Griffiths, Thomas L., Fisac, Jaime Fernández

arXiv.org Artificial IntelligenceJan-15-2025

Generative AI systems like foundation models (FMs) must align well with human values to ensure their behavior is helpful and trustworthy. While Reinforcement Learning from Human Feedback (RLHF) has shown promise for optimizing model performance using human judgments, existing RLHF pipelines predominantly rely on immediate feedback, which can fail to accurately reflect the downstream impact of an interaction on users' utility. We demonstrate that feedback based on evaluators' foresight estimates of downstream consequences systematically induces Goodhart's Law dynamics, incentivizing misaligned behaviors like sycophancy and deception and ultimately degrading user outcomes. To alleviate this, we propose decoupling evaluation from prediction by refocusing RLHF on hindsight feedback. Our theoretical analysis reveals that conditioning evaluator feedback on downstream observations mitigates misalignment and improves expected human utility, even when these observations are simulated by the AI system itself. To leverage this insight in a practical alignment algorithm, we introduce Reinforcement Learning from Hindsight Simulation (RLHS), which first simulates plausible consequences and then elicits feedback to assess what behaviors were genuinely beneficial in hindsight. We apply RLHS to two widely-employed online and offline preference optimization methods -- Proximal Policy Optimization (PPO) and Direct Preference Optimization (DPO) -- and show empirically that misalignment is significantly reduced with both methods. Through an online human user study, we show that RLHS consistently outperforms RLHF in helping users achieve their goals and earns higher satisfaction ratings, despite being trained solely with simulated hindsight feedback. These results underscore the importance of focusing on long-term consequences, even simulated ones, to mitigate misalignment in RLHF.

arxiv preprint arxiv, information, requirement, (15 more...)

arXiv.org Artificial Intelligence

2501.08617

Country:

North America > United States > New York (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

AI: Workers need more protection - TUC union

BBC NewsJul-19-2023, 23:05:41 GMT

An email written by our team members has a 65% satisfaction rating from customers,

protection, satisfaction rating, tuc union, (2 more...)

BBC News

Industry: Media > News (0.40)

Technology: Information Technology > Artificial Intelligence (0.49)

Add feedback

Alexa, Let's Work Together: Introducing the First Alexa Prize TaskBot Challenge on Conversational Task Assistance

Gottardi, Anna, Ipek, Osman, Castellucci, Giuseppe, Hu, Shui, Vaz, Lavina, Lu, Yao, Khatri, Anju, Chadha, Anjali, Zhang, Desheng, Sahai, Sattvik, Dwivedi, Prerna, Shi, Hangjie, Hu, Lucy, Huang, Andy, Dai, Luke, Yang, Bofei, Somani, Varun, Rajan, Pankaj, Rezac, Ron, Johnston, Michael, Stiff, Savanna, Ball, Leslie, Carmel, David, Liu, Yang, Hakkani-Tur, Dilek, Rokhlenko, Oleg, Bland, Kate, Agichtein, Eugene, Ghanadan, Reza, Maarek, Yoelle

arXiv.org Artificial IntelligenceSep-13-2022

Since its inception in 2016, the Alexa Prize program has enabled hundreds of university students to explore and compete to develop conversational agents through the SocialBot Grand Challenge. The goal of the challenge is to build agents capable of conversing coherently and engagingly with humans on popular topics for 20 minutes, while achieving an average rating of at least 4.0/5.0. However, as conversational agents attempt to assist users with increasingly complex tasks, new conversational AI techniques and evaluation platforms are needed. The Alexa Prize TaskBot challenge, established in 2021, builds on the success of the SocialBot challenge by introducing the requirements of interactively assisting humans with real-world Cooking and Do-It-Yourself tasks, while making use of both voice and visual modalities. This challenge requires the TaskBots to identify and understand the user's need, identify and integrate task and domain knowledge into the interaction, and develop new ways of engaging the user without distracting them from the task at hand, among other challenges. This paper provides an overview of the TaskBot challenge, describes the infrastructure support provided to the teams with the CoBot Toolkit, and summarizes the approaches the participating teams took to overcome the research challenges.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2209.06321

Country:

North America > United States (0.04)
Asia > China > Hong Kong (0.04)

Genre:

Overview (1.00)
Workflow (0.93)

Industry:

Consumer Products & Services (0.47)
Education (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Joint Turn and Dialogue level User Satisfaction Estimation on Multi-Domain Conversations

Bodigutla, Praveen Kumar, Tiwari, Aditya, Vargas, Josep Valls, Polymenakos, Lazaros, Matsoukas, Spyros

arXiv.org Artificial IntelligenceOct-8-2020

Dialogue level quality estimation is vital for optimizing data driven dialogue management. Current automated methods to estimate turn and dialogue level user satisfaction employ hand-crafted features and rely on complex annotation schemes, which reduce the generalizability of the trained models. We propose a novel user satisfaction estimation approach which minimizes an adaptive multi-task loss function in order to jointly predict turn-level Response Quality labels provided by experts and explicit dialogue-level ratings provided by end users. The proposed BiLSTM based deep neural net model automatically weighs each turn's contribution towards the estimated dialogue-level rating, implicitly encodes temporal dependencies, and removes the need to hand-craft features. On dialogues sampled from 28 Alexa domains, two dialogue systems and three user groups, the joint dialogue-level satisfaction estimation model achieved up to an absolute 27% (0.43->0.70) and 7% (0.63->0.70) improvement in linear correlation performance over baseline deep neural net and benchmark Gradient boosting regression models, respectively.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2010.02495

Country:

North America > United States > New York > New York County > New York City (0.04)
Asia > Singapore (0.04)
North America > United States > New York > Monroe County > Rochester (0.04)
(6 more...)

Genre: Research Report (1.00)

Industry: Media (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

An empirical study of computing with words approaches for multi-person and single-person systems

Gupta, Prashant K, Muhuri, Pranab K.

arXiv.org Artificial IntelligenceApr-30-2020

Computing with words (CWW) has emerged as a powerful tool for processing the linguistic information, especially the one generated by human beings. Various CWW approaches have emerged since the inception of CWW, such as perceptual computing, extension principle based CWW approach, symbolic method based CWW approach, and 2-tuple based CWW approach. Furthermore, perceptual computing can use interval approach (IA), enhanced interval approach (EIA), or Hao-Mendel approach (HMA), for data processing. There have been numerous works in which HMA was shown to be better at word modelling than EIA, and EIA better than IA. But, a deeper study of these works reveals that HMA captures lesser fuzziness than the EIA or IA. Thus, we feel that EIA is more suited for word modelling in multi-person systems and HMA for single-person systems (as EIA is an improvement over IA). Furthermore, another set of works, compared the performances perceptual computing to the other above said CWW approaches. In all these works, perceptual computing was shown to be better than other CWW approaches. However, none of the works tried to investigate the reason behind this observed better performance of perceptual computing. Also, no comparison has been performed for scenarios where the inputs are differentially weighted. Thus, the aim of this work is to empirically establish that EIA is suitable for multi-person systems and HMA for single-person systems. Another dimension of this work is also to empirically prove that perceptual computing gives better performance than other CWW approaches based on extension principle, symbolic method and 2-tuple especially in scenarios where inputs are differentially weighted.

cww approach, extension principle, frequency, (14 more...)

arXiv.org Artificial Intelligence

2004.14892

Country:

North America > United States (1.00)
Asia > India > NCT > New Delhi (0.04)
Asia > India > NCT > Delhi (0.04)

Genre: Research Report (0.49)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Energy (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Fuzzy Logic (0.46)

Add feedback

Multi-domain Conversation Quality Evaluation via User Satisfaction Estimation

Bodigutla, Praveen Kumar, Polymenakos, Lazaros, Matsoukas, Spyros

arXiv.org Machine LearningNov-17-2019

An automated metric to evaluate dialogue quality is vital for optimizing data driven dialogue management. The common approach of relying on explicit user feedback during a conversation is intrusive and sparse. Current models to estimate user satisfaction use limited feature sets and employ annotation schemes with limited generalizability to conversations spanning multiple domains. To address these gaps, we created a new Response Quality annotation scheme, introduced five new domain-independent feature sets and experimented with six machine learning models to estimate User Satisfaction at both turn and dialogue level. Response Quality ratings achieved significantly high correlation (0.76) with explicit turn-level user ratings. Using the new feature sets we introduced, Gradient Boosting Regression model achieved best (rating [1-5]) prediction performance on 26 seen (linear correlation ~0.79) and one new multi-turn domain (linear correlation 0.67). We observed a 16% relative improvement (68% -> 79%) in binary ("satisfactory/dissatisfactory") class prediction accuracy of a domain-independent dialogue-level satisfaction estimation model after including predicted turn-level satisfaction ratings as features.

annotator, estimation model, satisfaction rating, (12 more...)

arXiv.org Machine Learning

1911.08567

Country:

South America > Paraguay > Asunción > Asunción (0.04)
North America > Canada (0.04)
Europe > United Kingdom > England > Greater London > London (0.04)
(4 more...)

Genre: Research Report (0.82)

Industry:

Media > Film (0.93)
Leisure & Entertainment (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.35)

Add feedback

Domain-Independent turn-level Dialogue Quality Evaluation via User Satisfaction Estimation

Bodigutla, Praveen Kumar, Wang, Longshaokan, Ridgeway, Kate, Levy, Joshua, Joshi, Swanand, Geramifard, Alborz, Matsoukas, Spyros

arXiv.org Artificial IntelligenceAug-19-2019

An automated metric to evaluate dialogue quality is vital for optimizing data driven dialogue management. The common approach of relying on explicit user feedback during a conversation is intrusive and sparse. Current models to estimate user satisfaction use limited feature sets and rely on annotation schemes with low inter-rater reliability, limiting generalizability to conversations spanning multiple domains. To address these gaps, we created a new Response Quality annotation scheme, based on which we developed turn-level User Satisfaction metric. We introduced five new domain-independent feature sets and experimented with six machine learning models to estimate the new satisfaction metric. Using Response Quality annotation scheme, across randomly sampled single and multi-turn conversations from 26 domains, we achieved high inter-annotator agreement (Spearman's rho 0.94). The Response Quality labels were highly correlated (0.76) with explicit turn-level user ratings. Gradient boosting regression achieved best correlation of ~0.79 between predicted and annotated user satisfaction labels. Multi Layer Perceptron and Gradient Boosting regression models generalized to an unseen domain better (linear correlation 0.67) than other models. Finally, our ablation study verified that our novel features significantly improved model performance.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

1908.07064

Country:

Asia > Singapore (0.14)
Asia > Middle East > Republic of Türkiye (0.14)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

An Introduction to Machine Learning Theory and Its Applications

#artificialintelligenceJan-31-2017, 01:25:05 GMT

The supply of able ML designers has yet to catch up to this demand. A major reason for this is that ML is just plain tricky. This tutorial introduces the basics of Machine Learning theory, laying down the common themes and concepts, making it easy to follow the logic and get comfortable with the topic. So what exactly is "machine learning" anyway? ML is actually a lot of things. The field is quite vast and is expanding rapidly, being continually partitioned and sub-partitioned ad nauseam into different sub-specialties and types of machine learning.

artificial intelligence, machine learning, predictor, (17 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.89)

Industry: Banking & Finance (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)

Add feedback

An Introduction to Machine Learning Theory and Its Applications: A Visual Tutorial with Examples

#artificialintelligenceJan-3-2017, 18:20:40 GMT

Machine Learning (ML) is coming into its own, with a growing recognition that ML can play a key role in a wide range of critical applications, such as data mining, natural language processing, image recognition, and expert systems. ML provides potential solutions in all these domains and more, and is set to be a pillar of our future civilization. The supply of able ML designers has yet to catch up to this demand. A major reason for this is that ML is just plain tricky. This tutorial introduces the basics of Machine Learning theory, laying down the common themes and concepts, making it easy to follow the logic and get comfortable with the topic. So what exactly is "machine learning" anyway?

artificial intelligence, machine learning, predictor, (15 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.89)

Industry: Banking & Finance (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.95)

Add feedback

An Introduction to Machine Learning Theory and Its Applications: A Visual Tutorial with Examples

#artificialintelligenceDec-11-2016, 12:15:37 GMT

No discussion of ML would be complete without at least mentioning neural networks. Not only do neural nets offer an extremely powerful tool to solve very tough problems, but they also offer fascinating hints at the workings of our own brains, and intriguing possibilities for one day creating truly intelligent machines. Neural networks are well suited to machine learning problems where the number of inputs is gigantic. The computational cost of handling such a problem is just too overwhelming for the types of systems we've discussed above. As it turns out, however, neural networks can be effectively tuned using techniques that are strikingly similar to gradient descent in principle. A thorough discussion of neural networks is beyond the scope of this tutorial, but I recommend checking out our previous post on the subject.

artificial intelligence, machine learning, predictor, (15 more...)

#artificialintelligence

Genre: Instructional Material (0.47)

Industry:

Banking & Finance (0.47)
Education (0.35)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback